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Abstract 

Several recent works point out that the crowd of smaU unobservable earthquakes (with magnitudes 
below the detection threshold md) may play a significant and perhaps dominant role in triggering future 
seismicity. Using the ETAS branching model of triggered seismicity, we apply the formalism of generating 
probability functions to investigate how the statistical properties of observable earthquakes differ from 
the statistics of all events. The ETAS (cpidcmic-typc aftershock sequence) model assumes that each 
earthquake can trigger other earthquakes ("aftershocks"). An aftershock sequence results in this model 
from the cascade of aftershocks of each past earthquake. The triggering efficiency of earthquakes is 
assumed to vanish below a lower magnitude limit uiq, in order to ensure the convergence of the theory and 
may refiect the physics of state-and-velocity frictional rupture. We show that, to a good approximation, 
the ETAS model is renormalized onto itself under what amounts to a decimation procedure niQ rrid, 
with just a renormalization of the branching ratio from n to an effective value n{md)- Our present 
analysis thus confirms, for the full statistical properties, the results obtained previously by one of us and 
Werner, based solely on the average seismic rates (the first-order moment of the statistics). However, our 
analysis also demonstrates that this renormalization is not exact, as there are small corrections which can 
be systematically calculated, in terms of additional contributions that can be mapped onto a different 
branching model (a new relevant direction in the language of the renormalization group) . 
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I. INTRODUCTION 



In the last few years, physicists' interest for the space-time organization of seismicity in different 
regions of the world has spurred. This recent burst of attention is probably due to the introduction 
of new diagnostic tools applied to earthquake catalogs [1-19] and to improved insights from cartoon 
models of earthquakes [20-25]. The first class of papers in particular suggest to re-examine the 
standard statistical properties of earthquakes, usually documented under the following distinct 
power law and fractal properties: (i) the Gutenberg- Richter distribution ~ (with (3 ^ 2/3) 

of earthquake energies E [26]; (ii) the Omori law ~ (with p ^ 1 for large earthquakes) of 
the rate of aftershocks as a function of time t since a mainshock [27]; (iii) the productivity law 
~ E"- (with a ~ 2/3) giving the number of earthquakes triggered by an event of energy E [28]; 
(iv) the power law distribution ~ 1/L^ of fault lengths L [29]; (v) the fractal structure of fault 
networks [30] and of the spatial organization of earthquake epicenters [31]; (vi) the distribution 
1 / s'^'^^ (with 5 > 0) of seismic stress sources s in earthquake focal zones due to past earthquakes 
[32]. Specifically, the statistical analysis based on (a) coarse-grained scaling ansatz [1, 2, 7-9, 13] 
(b) entropic methods [4, 15], and (c) network methods [6, 10-12] suggest that the above standard 
seismological description [26-32] may be inadequate. It is not clear however what should be the 
correct physical model. Several papers have however questioned the novelty of the insights derived 
from these approaches [2, 5, 16]. 

The present authors are among those who have studied how the standard seismological laws 
[26-32] (in particular the laws (i)-(iii) mentioned above) could actually go a long way towards 
explaining most of the empirical phenomenology of seismicity, including the supposed anomalous 
or "novel" scaling laws proposed by the above quoted physicists (see for instance [33-38]). In this 
series of papers, we have developed a consistent statistical description of seismicity using models 
of triggered seismicity, which allows one to make quantitative predictions of observables that can 
be compared with empirical data. The simplest class of models of triggered seismicity combines 
the above mentioned Gutenberg- Richter (i), Omori (ii), and productivity laws (iii) which can be 
applied to a fractal spatial geometry of earthquake epicenters (v) [38]. The fundamental physical 
ingredient is that each earthquake can trigger other earthquakes and an earthquake sequence 
results in this model from the cascade of events triggered by past earthquakes. The usual notions 
of foreshocks, mainshocks and aftershocks lose their specificity as any earthquake can be triggered 
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by previous earthquakes and may trigger itself subsequent earthquakes. Here, we continue our 
study of the general branching process, called the Epidemic- Type Aftershock Sequence (ETAS) 
model of triggered seismicity, introduced by Ogata in the present form [39] and by Kagan and 
Knopoff in a slightly different form [40] and whose main average statistical properties are reviewed 
in [41]. This model has been shown to constitute a powerful null hypothesis to test against other 
models [39]. The ETAS model belongs to a general class of branching processes [42, 43]. It can 
be viewed as the monofractal approximation of the more general multifractal model of triggered 
seismicity introduced recently in [24, 25], which derives from the physics of thermally activated 
rupture aided by stress. 

The physical problem addressed here is the following. We start from the empirical evidence 
[28, 44] that small earthquakes dominate or are at least equivalent collectively to large earthquakes 
in triggering other earthquakes. This can be seen by combining the Gutenberg-Richter law (i) 
oc 10^''™ and the productivity law (iii) oc 10"™ to obtain the typical number oc 10^(''~")™ of events 
triggered by earthquakes of magnitude between m and m + 1. With the empirical estimates of 6 ~ 1 
and 0.8 < q; < 1 together with the observation that triggered events seem to have magnitudes with 
only weak or no relation with the magnitude of the triggering event (magnitude-independence law) 
[44] (i.e., large earthquakes can be triggered by small events), this implies the perhaps surprising 
conclusion that large earthquakes are triggered more by the swarm of small previous earthquakes 
than by preceding large earthquakes. This stems from the observation that the number of small 
earthquakes increases faster as their magnitude decrease than their productivity decreases. The 
conclusion that small earthquakes dominate triggering is thus intrinsically a collective effect. This 
picture, which emphasizes the collective organization of earthquakes or "many-body" view, can be 
contrasted with the "one-body" or few-body approach of R. Stein and co-workers [45, 46] which 
focuses exclusively on how a few large earthquakes can promote subsequent shocks at some sites 
and inhibit them in others. If indeed the small earthquakes dominate in the triggering of future 
events, this begs to define how small "small" can be, since the smaller the earthquakes the larger 
their triggering influence. The evidence that small earthquakes should dominate triggering is based 
on the empirical statistics (i) and (iii) established for event magnitudes above magnitude 2 or 3 
(depending on the completeness of the studied catalogs). The question of how small "small" 
is amounts to asking how far in the small magnitude range can the productivity law and the 
magnitude-independence law be extrapolated. Because the Gutenberg-Richter law (i) has been 
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observed at such small scales as individual dislocation motions, we know for sure that there must be 
a lower "ultra-violet" cut-off magnitude mo at which the productivity of events of magnitude smaller 
than mo tapers off or vanishes. Otherwise, the factor oc 10"^''""^™ would diverges as m — > — oo 
(energy goes to zero) . Is the ultra-violet cut-off associated with an atomic scale for rupture? Or are 
other relevant scales? This question has been addressed in two recent papers by M. Werner and one 
of us [47, 48] within the framework of the ETAS model. Consider a catalog complete for magnitudes 
above some observational threshold m^, i.e., all earthquakes with magnitudes m > rud have been 
recorded but smaller earthquakes are not. Noting that the magnitude m^ of completeness of a 
seismic catalog is not in general the same as the magnitude mo of the smallest triggering earthquake, 
Ref. [47] showed that bounds for mo can be obtained from quantitative fits to observed aftershock 
sequences. In addition, Ref. [48] remarked that, in models of triggered seismicity and in their 
estimation from empirical data, the detection threshold m^ is commonly equated to the magnitude 
mo of the smallest triggering earthquake. This unjustified assumption neglects the possibility of 
shocks below the detection threshold triggering observable events, a process which should dominate 
according to our previous discussion. Ref. [48] developed a mean field formalism within the ETAS 
model: by considering the branching structure of one complete cascade of triggered events, the 
catalog of observed events with magnitude above m^ was shown to be described by an effective 
"renormalized" ETAS model with its lower magnitude cut-off equal to m^ but with an apparent 
branching ratio (which is the apparent fraction of aftershocks in a given catalog) and an apparent 
background source Sa, due to the presence of smaller undetected events capable of triggering larger 
events. This result is potentially very important since it implies that previous estimates of the 
clustering characteristics of seismicity may significantly underestimate the true values: for instance, 
an observed fraction of 55% of aftershocks is renormahzed into a true value of 75% of triggered 
events. 

The object of the present paper is to extend the previous mean field treatment to obtain the 
full earthquake statistics using the formalism of generating probability function (GPF) already 
developed in [35-38]. In a sense, the question addressed here is whether the ETAS model can 
be renormalized onto itself by moving mo to m^ > mo (which can be seen as a coarse-graining 
operation), that is, is there an effective ETAS model with minimum magnitude nid and with renor- 
malized parameters, which describes the observed catalogs? Beyond its interest and application 
to earthquakes, this problem is relevant to a general understanding of coarse-grained properties of 
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marked branching processes, to which our formahsm appUes. 

The organization of the paper is the following. Section II A introduces the general formulation 
of observable clusters of triggered events using the generating probability functions (GPF). It also 
presents a simple intuitive approximation which will be make rigorous in later sections. Section 
II B derives general relations for the effective branching rates of observable events. Section II C 
defines the ETAS model and recalls its main useful properties. Section II D introduces the GPF 
for unobscrvable and observable aftershocks. Section II E gives the main properties of the effective 
branching rates, which recover the previous analysis of [48] in a slightly different form. Section 
IIP explains that the present approach and that of [48] are equivalent physically but with a 
different mathematical formulation. The justification for introducing a physically equivalent but 
mathematically different formulation here is that it is more adapted to the calculations of the full 
statistics with the GPP formalism. Section III presents all our results on the statistics of observable 
events in the ETAS branching model. Section III A derives the general equation governing the GPF 
of observable events. Section III B use the derivation of the previous section to give quantitative 
estimates for the fraction of observable events. Section IIIC discusses the approximation of self- 
similarity, corresponding to a renormalization of the ETAS model onto itself by the change from mo 
to nid. This self-similarity amounts to say that the statistics of observable events can be deduced 
entirely from the statistics of all events under a simple renormalization of the average branching 
ratio into an effective value. Section III D derives the implications of the self-similar approximation 
for the distribution of the numbers of observable events. Section III E discusses the deviations from 
self-similarity and identifies a correction in the form of a new branching model, which gives rather 
small corrections to the previously self-similar estimates. The last section concludes. 

II. DEFINITION AND PROPERTIES OF EFFECTIVE RATE OF OBSERVABLE AF- 
TERSHOCKS 

A. General formulation of observable clusters 

In this section, we present the general formulation of generating probability function (GPF) for 
marked branching processes with an observational constraint. Recall that, for general branching 
processes such as the ETAS model, the GPF formalism allows one to calculate the full statistical 
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properties. Here, the mark associated with an event is its magnitude. The observation constraint 
is that only events with magnitude m > md, where is the observation threshold, are known, 
while the process produces events which can have a lower magnitude, down to a lower triggering 
cut-off mo- 

To get a first feeling of how the observational constraint can be taken into account in the 
GPF formalism, consider the case of a large finite time window of size r in which we count the 
number of events and let us use the approach developed in [36] for the statistics of the number of 
such windowed events. The time windows are considered large if their size is significantly larger 
than the typical hfe-time of the clusters, defined as the sequence of aftershocks, triggered by single 
background event (see [36] for a discussion). In this hmit, the statistics of the total (observable and 
unobservable) number of events in a window of size r is obtained from the generating probability 
function (GPF) 0^(2;; r), which obeys the following equation 

e^(^;T) = e'^^[®(^)-i] . (1) 

Q{z) is the GPF of the number of all the aftershocks triggered by a given source, including the 
source event itself and u is the Poisson intensity of the background sources. We have shown [35] 
that 6(^) has the structure 

e(^) = zG{z) , (2) 

where the factor z to the left of G{z) takes into account the contribution of the background source, 
while the GPF G{z) describes the statistics of the number of all aftershocks within a given cluster 
In view of (2), the GPF for finite time windows given by (1) is the natural generalization of the 
GPF obtained for the Poissonian background events: 

efe(z;T) = e'^^(^-i) . (3) 

Now, the statistics of observable events requires to replace the GPF 6(^) in (1) by the GPF 
Q{z; md) of the number of aftershocks (and their sources) whose magnitudes m are larger than the 
detection threshold to obtain 

e^{z] md, r) = e'^-[©(^'"^'i)-il . (4) 

Note that some clusters might have no observable events at all. This means that there is a non-zero 
probability 

p{md) = e{z = 0;md)y^O (5) 
7 



that the cluster is completely unobservable. The complementary probability 



q{md) = 1 - p{md) = 1 - 6(0; ma) (6) 

is the probabihty that there is at least one observable event (source or some aftershock) in the 
cluster under inspection. In what follows, we refer to a cluster as "observable," if it contains at 
least one observable event (with magnitude m > rrifi). Accordingly, q{md) defined in (6) is the 
probability that a cluster is observable; it is also the fraction of observable clusters. 
It is convenient to express the GPF ©(2;; rrid) in the form 

Q{z] rud) = q{md)Q{z; rud) + I - q{md) , (7) 

where 

Q{z;md)^^^[Q{z;md)-e{^]md)] (8) 

is nothing but the conditional GPF of the number of observable events within observable clusters. 
This definition implies that it has the same structure 

Q{z : rud) = zG{z;md) , (9) 

as that given by (2) of the GPF Q{z) of the total number of events belonging to some cluster. 
Expression (9) implies that one can treat the observable event which comes first in time as the 
"observable source," and then interpret G{z; nid) as the GPF of its observable aftershocks. 

Substituting (7) into (4) obtains the following representation for the GPF of the number of 
observable windowed events 

Q^{z] rud, t) = e^('"<^)^l®("'"*<^)-^l , (10) 

where 

io{md) = ioq{md) (11) 

is the renormalized intensity of "observable sources," which is the same as the intensity of observable 
clusters by definition. 

There is a physically transparent way to estimate the probability q{md) that a cluster is observ- 
able. It is indeed always possible to represent q{md) in the form 

q{md) = q^{md)Q{md) + q~{md)[l - Q{md)] . (12) 
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Here, q^{md) (respectively q~{md)) is the probability that the cluster is observable under the 
condition that its generating source is also observable (respectively unobservable) . In addition, 
Q{md) is the probability that the source is observable. Obviously, we have 

q^irud) = 1 , q^ima) = l-p~{md) , (13) 

where p~{md) is the probability that all aftershocks triggered by an unobservable event are unob- 
servable. To estimate p~ (m^) , we make the assumption that each unobservable event either triggers 
only one first-generation aftershock, with probability //(m^), or does not trigger any aftershocks 
with the probability 1 — z/(m^). This approximation is quite reasonable, as can be seen from the 
application of the productivity law E°' -^^am, (^^j^j^ q ~ 2/3, a ~ 1): if an event of magnitude 
m — 7 produces about 10^ observable events on average, an event of magnitude 2 triggers about 0.1 
events on average. In this example, i^(md) ~ 0.1 and 1 — ^{md) ~ 0.9 and the error in neglecting 
the possibihty for this event to trigger two aftershocks is of order 0.01. This error becomes even 
smaller for smaller unobservable sources. 

Suppose additionally that the magnitudes of the triggered aftershocks are statistically indepen- 
dent of each other. Then, the probability that all aftershocks, triggered by an unobservable event, 
are unobservable, is given by 

oo 

p-{md) ~ E[l - y{md)]u\md)[l - Q{md)f , (14) 

it=0 

where — are the geometrical probability that an unobservable event triggers k aftershocks, 
while (1 — QY is the probability that they are all unobservable. After summation, we obtain 

Substituting this expression into (13) and then (13) into (12), we obtain the probability that a 
cluster is observable under the form 

1 - i/(mrf)[l - Q{md)\ 

In the following, we obtain with the framework of the ETAS model, a physically transparent 
relation for the probability i^(md), which will allow us to obtain an accurate estimation of the 
probability q{md) given by (16) and the corresponding renormalized intensity uj{md) of "observable 
sources" given by (11). Specifically, the role of ^{md) is derived in expression (37) below for the 

GPF of first-generation aftershocks triggered by unobservable event. 
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B. Effective observable aftershocks rate 



Before turning to the specifics of the ETAS model and its statistics of observable events, it is 
useful to discuss the properties of the average rate of general branching processes. The results 
obtained in this section recover those obtained in [48] within a slighly different interpretation, that 
we present to be self-contained and to connect with the subsequent derivation of the full number 
statistics in following sections. 

It is well-known that the key parameter controlling the properties of cascades of triggered events 
is the branching rate n, which is nothing but the average number of first generation aftershocks, 
where the average is performed over all possible triggering event of arbitrary magnitude. The cases 
n < 1, n — 1 and n > 1 correspond respectively to the sub-critical, critical and super-critical 
regimes, with the first-two giving stationary time series in the presence of a stationary immigration 
of sources and the later giving explosive time series with a positive probability [41-43]. 

In branching processes (of which the ETAS model is an example), we can use the representation 
that each shock triggers independently its own aftershocks sequence (see [48] for a discussion on 
the two interpretations in terms of decoupled branches used here or of collective triggering; the two 
views are equivalent due to the linear sum over past events and the conditional Poisson process 
formulation) . The independence between different branches allows us to obtain the average (R) of 
the total number of events (mainshock itself and all its offsprings over all geneations) triggered by 
an arbitrary mainshock as [49] 

(i?) = 1 + n + + • • • = , (17) 

1 — n 

where is the contribution of the k-th generation of aftershocks. Thus, if the average number 
(R) of events per cluster is known, the aftershocks rate can then be obtained as 

n=^. (18) 

This simple remark will be useful in the following to derive an apparent or renormalized branching 
ratio n(md) and test its usefulness to describe the full number statistics. 

The average number of observable events, which are triggered by some arbitrary source, is simply 
n given by (18) multiplied by the probability Qinifi) that an event is observable: 

/■oo 

Q{md) = / P{m)dm , (19) 
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where P{m) is the probabihty density function (PDF) of their random magnitudes (assumed to be 
the same for sources and all aftershocks). This gives 

{R){m,) = {R)Q{m,) = ^^ . (20) 

1 — n 

Consider now the conditional average (R) {md) of the number of observable events within some 
observable cluster. It is simply given by 

(^)(m.) = 4^ = ^^^^. (21) 

This quantity (i?) {rrid) can not be derived as straightforwardly as the average number {R) over all 
events obtained with (17). Indeed, an observable cluster may result from an effective "observable 
source" which might belong, for instance, to the 3-th or even 7-th generation of the total after- 
shock sequence. Moreover, it seems impossible to classify uniquely observable events of observable 
clusters as belonging to observable aftershocks of first, second or /c-th generations. Therefore, it 
is not possible to use for {R){md) the reasonings underlying relation (17). Notwithstanding this 
limitation, we can introduce an effective branching ratio for observed clusters, based on a natural 
extension of relation (18). Let us thus define the effective branching ratio of observable clusters as 

= . (22) 

{R}{md) 



With (21), this gives 



Substituting in (23) the r.h.s. of equality (16) yields 

^ n - uimd)[l - Q{md)] 

1-Km.)[l-Q(m,)] ' ^'^^ 

expressing the effective average aftershock rate via the probability Qinid) that a background event 
is observable and the probability v{rnd) that an unobservable background event triggers some 
first-generation aftershock. 

C. Basic properties of the ETAS model 

To make further progress and in particular to calculate the probability qirud) given by (16) that 
a cluster is observable and to obtain the effective branching rate n{md) given by (24), we need 
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to specify the properties of ETAS branching modeL The ETAS model is defined by the following 
rules. Each event of magnitude m triggers a Poissonian sequence of aftershocks characterized by 
the Poissonian GPF [35] 

Gi{x;m\K)^e''^'^"'^^'-^\ (25) 

where Kfi^m) is the average number of first generation aftershocks triggered by a mainshock of 
magnitude m, k is a numerical constant and /x(m) describes the so-called productivity law, i.e., the 
dependence of the number of first generation aftershocks number on the mainshock magnitude m. 
Previous empirical studies have established that the productivity law is approximately exponential 
[28, 44]: 

/i(m) = 10"("*-"^o) . (26) 

with an exponent a in the range 0.8 — 1. Here, mo is the lower magnitude threshold below which 
events are supposed not to be able to trigger any aftershock. The ETAS model also uses the 
well-known Gutenberg-Richter (GR) law for the PDF of earthquake magnitudes 

P(m) = 61n(10)10-''("*-"^°) , with b^l , (27) 

which are assumed to be independently drawn at each event occurrence. Averaging the GPF 
defined by (25) over all possible random source magnitudes m weighted by the GR distribution 
(27), we obtain the GPF of first generation aftershock numbers triggered by an arbitrary source: 

Gi(z\k) ^ F[k(1 - z)] , (28) 

where 



F{x) = 7 e-^^y^^ = 7Z/^r(-7, y) , 7 = - , (29) 



and r(— 7,1/) is the incomplete Gamma function. In the sequel, we shall use the following power 
law expansion of the function F{x) 



F{x)c^l-^^ x + (3x\ /5 = 7r(-7) = Q^— ^. (30) 
7 — 1 7 — 1 

Thus, for 7 — 1+, both coefficients of x and x'^ grow together. 

Our previous calculations have shown that this expansion is very accurate for 7 < 1.25, which 
is the relevant range [35-38] . 
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This expansion (30) allows us to express the main properties of the statistics of the number 
of aftershocks. For this, let us substitute (30) into (28) to obtain the corresponding approximate 
expression for the GPF of the number of first generation aftershocks 

Gi{z\k) c^l-n{l- z) + (3k"'{1- zy , (31) 

where 

n = {R^) = ^ (32) 
7-1 

is the average aftershock branching ratio, i.e., the average of the total number of first gen- 
eration aftershocks triggered by an arbitrary source. Recall that the last term (3k"'{1 — z)"' in the 
r.h.s. of expression (31) expresses the property that the distribution Vi{r\K.) of the total number 
of first generation aftershocks triggered by an arbitrary source has a power law tail 

^dr\^) - % ■ (33) 

Expression (33) is the leading asymptotic of the exact expression 

1 d'-GAzlK 



Vi{r\K) 



= 7^r(r - 7, «) , (34) 
z=Q r! 



corresponding to the exact GPF (28) of the number of first generation aftershocks. Ref. [35] has 
shown that the power law (33) leads to a PDF of the total number of aftershocks of all generations 
which are triggered by an arbitrary source, which has a fatter tail ~ l/r^+^^Z'''^ close to criticality 
n pa 1 . 



D. Observable and unobservable aftershocks 

Let us now consider a different averaged GPF (25) obtained by using a truncated GR law 
constrained to unobservable earthquakes (with magnitudes m between mo and nid): 

61n(10)10"^('"""'°) 

P {m\md) = ^ _ ^ H{md - m)H{m - mo) , (35) 

where H{x) is the unit step (Heaviside) function and Q{md) — iQ-^'i'^d-mo) according to (19) and 
(27) is the probabihty for an earthquake to be observable. Averaging expression (25) over all 
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magnitudes weighted by P {m\md) given by (35) yields the GPF of the number of first-generation 
aftershocks triggered by an unobservable event: 

r-(,\, ^ ^ F[k{1 - z)] - Q{ma)F[Kii{ma){l - z)] 

G, {z\K, ma) 1 - Q^ma) ' ^^^^ 

Substituting the expansion (30) in (36) yields finally 

G^{z\k, ma)^l- v{ma){l - z) + 0[{l - zf\ , (37) 

where the coefficient ^{md} appears here from its definition as the probability that an unobservable 
background event triggers some aftershock. The expansion (37) at this linear order for the GPF 
of first-generation aftershocks triggered by unobservable event is equivalent to saying that an 
unobservable event can trigger at most a single aftershock, in agreement with the approximation 
used to obtain (15) and (16). 

Expressions (30) and (36) thus yield 

where 

p{md) = Q{md)n{md) = [Ai(md)]^"^ (39) 

describes the competition between the GR and productivity laws at the observational magnitude 
threshold m^. Multiplying (38) by the fraction 1 — Q{md) of unobservable sources yields the average 
number (Ri) of first generation aftershocks triggered by an unobservable source. (Ri) can be 
interpreted as the branching rate n~ {rud) of first-generation aftershocks triggered by unobservable 
sources: 

{Ri){md) = n'{md) = n[l - p{md)\ . (40) 

Note that the GPF (37) does not contain a term of the form ~ (1 — z)'^ as in (31), which was 
responsible for the power law tail (33) of the PDF of the total number of first generation aftershocks. 
As a consequence, the tail of the PDF of first-generation aftershocks triggered by unobservable 
sources is thinner than a power law. The power law tail (33) is simply due to the interplay between 
the productivity law (26) and the GR law (27) for the sources. Now, constraining the source 
magnitudes to be smaller than truncates the GR law and thus the PDF of the number of 
first-generation events. 
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Let us now turn to the statistics of first-generation aftershocks triggered by observable sources. 
The corresponding GPF is obtained by averaging (25) over all magnitudes weighted by the following 
modified GR law: 

P+(m; ma) = ^^H{m - ma) = bln{10)10-''^"''^H{m - ma) . (41) 
Q[md) 

This leads to 

Gt{z\K,md)^F[Kfx{md){l-z)] . (42) 

Note that expression (42) differ from the GPF (28) of the total number of first-generation events 
only through the renormalization 

K K{md) = KiJ,{md) . (43) 

This allows us to interpret the average number of first-generation aftershocks triggered by an 
arbitrary observable source as an effective branching rate n'^{md) equal to 

(R'l) = n+(m<i) = nn{md)Q{md) = np{md) , (44) 

where p{md) is defined by (39). Not surprisingly, the PDF of the number of first-generation 
aftershocks triggered by observable background events has a power law tail, 

1 dJ'Gi^zln, rrid) 



(45) 

2=0 



analogous to (33). 

These different results are summarized in Figure 1 which shows the PDF's Vi{r\K,'md) and 
Vi{r\K, rud) as a function of the number r of events obtained from the exact relation (34), for two 
different values of ii{md)- 



E. Properties of effective aftershock rates 

We are now armed to discuss in the framework of the ETAS model the properties of the prob- 
ability q{md) given by (16) for a cluster to be observable and the corresponding expression (24) 
for the effective branching rate n{md) of observable clusters. Note again that the expansion (37) 
at this linear order writes that an unobservable event can trigger at most a single aftershock, in 
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agreement with the approximation used to obtain (15) and (16). This entitles us to substitute (38) 
into (16) and (24) to obtain 



»('"^) = 1 - nil - MmM • 



and 

n[l - p{md)] ' 

where p{ma) is defined by (39). In the following sections, these two relations (46) and (47) will 
be derived from the exact equations obeyed by the GPF's of the number of aftershocks over all 
generations. In the mean time, let us discuss their properties and seismological implications. 

Using the notations (40) and (44), we can rewrite the effective rate (47) of observable clusters 
in the form 

"(m.) = T^i^ . (48) 

1-n (rrirf) 

and interpret it as the rate n^{md) of aftershocks triggered by observable sources, amplified by the 
impact of aftershocks triggered by unobservable sources since the denominator in (46) and (47) 
describes the influence of aftershocks triggered by unobservable sources. 

First, notice that, in critical case n = 1, we have n{rnd) = 1. Thus, the critical regime for all 
events is also critical for observable events. In this case, the probability qirrid) that a cluster is 
observable is given by 

q{md) = fJ.-\md) (n = 1) , (49) 

and decreases as the observation threshold increases, which parallels the intensity of effective 
observable sources given by (11). 

Two cases are worth discussing. For 

n-(md) =n[l-p(m<i)] < 1 , (50) 

which corresponds to a negligible productivity of unobservable events, then the impact of unob- 
served sources is small and 

q{md) ~ Q{md) , n{md) ~ n'^{md) = np{md) , (51) 

as if all aftershocks, which are triggered by observable sources, were observable. 
In contrast, for 

p{md) < 1 , (52) 
16 



we have 

^ ^ . u(m,) ^ . (63) 
1 — n 1 — n 

where the factor 1/(1 — n), quantifying the impact of clusters triggered by unobservable background 

events, becomes predominant. 

Expression (47) can be rewritten as 

= TTW^ ■ 

Thus, for 

mrf - mo < A* = logio (t—-) ' (^5) 



b-a °'^Vl-n. 

the effective aftershocks rate is critical: n{md) ^ n. For example, if n = 0.9, b — 1 and a — 0.8 we 
have A* ~ 4.77. Figure 2 (respectively 3) shows the dependence of the effective rate n{md) as a 
function of — mo (respectively n) for various n (respectively m^ — mo) . 



F. Correspondence between the present formalism and Sornette- Werner representation 
[48] 

At this point, the astute reader will have noticed that the expression (47) for the effective rate 
of observable events of first-generation is not the same as expression (10) of [48], which also gives 
an apparent branching ratio denoted for observable aftershocks of the first generation. Our 
present form (47) for n{md) departs from expression (10) of [48] for Ua via the denominator, that 
is, by the fact that n~ {rrid) defined in (50) is not zero. The two approaches arc actually equivalent 
as we now explain. Expression (10) of [48] defines an apparent branching rate n„ as only due 
to observable sources while n{md) given by (47) takes also into account the unobservable sources 
on observable aftershocks. In other words, n{md) given by (47) counts the effect of unobserved 
sources in the production of first generation events and thus describes the average number of first- 
generation daughters from unobservable aftershocks which are themselves "sources" for the future 
generations. In contrast, Sornette and Werner construct a representation in which the introduction 
of the observational cut-off rrid > mo not only renormalizes n into Ua given by their equation (10) 
but also introduces a renormalization of the spontaneous source rate [48] : for each real observable 
spontaneous sources, there are many apparent spontaneous sources which result from the fact that 
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an event triggered by an unobservable previous aftershock is considered a spontaneous source since 
one can not track its ancestor and can thus be counted as spontaneous. The two approaches can 
thus be summarized as follows: 

Sornette — Werner : 

{n, 1 spontaneous source} —>■ {n^ = np{md) , Sa — -/Vobs(^ ~ ^a) spontaneous sources} ,(56) 
where A^obs is the total number of observed aftershocks; 
present work : 

{n, 1 spontaneous source} — > {n(md) — — — -r , 1 spontaneous source} . (57) 

1 - n[l - p{md)\ 

Note that the fraction /„ = Sa/Nohs of apparent sources among all observed events given by 
expression (23) of [48] can be written 

fa = Sa/Nobs ^n-Ua^n- np{md) = n'~{md) , (58) 

where the last equality results from definition (50). This provides a physically intuitive interpre- 
tation of n~{md). Expression (47) can thus be written 

ft 

n{ma) = — ^ = n„(l + /„ + fl + ...) . (59) 

This formula clarifies completely the relationship between Sornette- Werner's formation and the 
present one: the first term Ha in the r.h.s. of (59) corresponds to the average number of daughters 
of first-generation due to an observable initial source; The second term Uafa corresponds to the 
average number of daughters of first-generation which are due to an apparent observable source 
which is triggered from a first-generation unobservable aftershock of the initial spontaneous source. 
The third term Uafl corresponds to the average number of daughters of first-generation which are 
due to an apparent source which was itself triggered by an apparent source of a first-generation 
unobserved aftershock of the initial spontaneous source; and so on... This reasoning demonstrates 
that the two formulations are physically equivalent, even though they have been obtained by 
different physical arguments. 



III. STATISTICAL DESCRIPTION OF OBSERVABLE EVENTS 



Until now, we have explored some properties of the fraction qirrid) of observable clusters and its 
corresponding effective observable aftershock rate n{md), using a physically transparent but non- 
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rigorous approach based on the properties of first-generation aftershocks triggered by observable 
and unobservable sources. In the following, we study the full statistical properties of observable 
events in large time window using the GPF's of the number of events of all generations, belong to 
an arbitrary cluster. 



A. Derivation of the GPF of observable events 



We start by the remark that the GPF of a single source of magnitude m, which takes into 
account the observability of the source, is equal to 



^Him-m,) _ 1 ^ _ im^rn - ma) 



1 , rrKuid , 

(60) 

z , m > rrid . 



Let us define G{z] m, md) as the GPF of the number of observable aftershocks of all generations 
which are triggered by a source (which can be observable or unobservable). Multiplying G{z] m, mii) 
by (60) yields the GPF Q{z;m,md) of the number of observable events triggered by a source of 
given magnitude m: 

e{z] m, ma) = z^^^^^Giz; m, ma) . (61) 

Averaging this expression over all possible source magnitudes weighted by the GR law (27) gives 
the GPF 

roo 

Q{z;'md) — / Q{z;m,'md)P{m)d'm (62) 

J mo 

of the total number of all observable events, which include all observable sources and all their 
observable aftershocks of all generations. 6(^;m(i) can be expressed as 

e(^; rud) = G{z] md\mo) + {z - l)G{z; md\md) (63) 

where 

roo 

G{z;m,d\x) — / G{z;'m,m,d)P{m)dm . (64) 

Jx 

Thus, determining the GPF Q{z;md) requires to calculate the GPF G{z;m,md) of the number 
of all observable aftershocks of all generations belonging to the same cluster. The later can be 
obtained by using the statistical independence of sources and aftershocks magnitudes, which leads 
to replacing z within the exponential of the r.h.s. of (25) by Q{z;md), which yields 

G{z; m, rrid) = e'^'^MIQ^^^"^'')-!] . (65) 
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Substituting (65) and (27) into (64) yields 

G{z; md\x) = Q{x)F{kii{x)[1 - e{z- m^)]) . (66) 

Using this expression (66) to express the terms in the r.h.s. of (63) leads to the following equation 
determining the sought GPF Q(z;md): 

e(z;md) = F(4l - e(z;ma)]) + (z - l)Q(ma)F(Kii{md)[l - e{z;ma)]) . (67) 

For rrid — mo {Q — /i — 1) such that all events can be observed, this equation reduces to the 
standard functional equation 

e{z\K) = zF(k[i - e(z\K)]) = zGi(e{z\K)\K) , (68) 

where Gi{z\k) is the GPF given by (28) of the number of first-generation aftershocks while Q{z\k) — 
Q{z; mo) is the GPF of the total number of event in a cluster. We make exphcit the dependence 
on the parameter k because it is going to play a crucial role in the following discussion. 
There is a physically natural partition of the GPF Q{z;md) given by (67) according to 

Q{z; md) = G'{z; nid) + zG^{z; nid) , (69) 

where 

G-(z;md) = F(k[1 - e(z;md)]) - Q(md)F(Kii(md)[l - e(z;md)]) , (70) 
describes the statistics of observable aftershocks triggered by an unobservable event, while 

G+{z;md) = Q{md)F{Ki^{md)[l - e{z;md)]) . (71) 

describes the statistics of observable aftershocks triggered by an observable event. 

There are a few exact consequences of relations (67)- (71) which can now be obtained. Consider 
the average number of events over all generations of a given cluster, given by definition by 

dQ{z;md) 



{R){md) 



dz 

Using equation (67), it is easy to show that it satisfies the equation 



(72) 

z=l 



(R) (md) = n{R) (nid) + Q{md) , (73) 
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whose solution (20) was already obtained from a direct probabilistic argument. By construction, 
{R){md) given by (20) is equal to the sum 

{R){ma) = Q{md) + (i?)+(m,) + {R)-{md) (74) 

of the contributions of observable events and aftershocks, which are triggered by both observable 
and unobservable events, with 



(75) 

z=l 



dz 

It follows from (70), (71) and (75) that 

{R)+{md) = np{md){R){md) = n+{md){R){md) , (76) 

and 

{R)-{md) = n[l - p{md)]{R){md) = n" (m^) (i?) (m^) . (77) 

These two relations confirm the physical meaning of the rates n^{md) defined in (40) and (44), 
which quantify the relative impact of aftershocks triggered by observable versus unobservable 
events. Expressions (74), (76) and (77) show that the rates n^{md) are the fractions of after- 
shocks of all generations which are triggered by observable ("*") versus unobservable (~) events. 

Figure 4 plots these two rates n'^{md) as a function of — ijiq for a — 0.8, h — 1, n — 0.9, 
showing that the impact of unobserved events may easily dominate for quite reasonable values of 
the model parameters. 



B. Fraction of observable clusters 



One of the key parameters governing the statistics of windowed observable events is the fraction 
q{md) defined by (6) of observable clusters. Equation (67) allows us to calculate it exactly. Indeed, 
it is easy to show that expression (67) implies that q{md) is solution of the equation 

q{md) = 1 - F[Kq{md)] + Q{md)F[Kii{md)q{md)] . (78) 

Noticing that Q{md) = [iJ,{'md)]~'^ = p~'^, we can rewrite (78) in the form 



"^[qirud)] = , 
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(79) 



where 

= 1 - X - F{kx) + h-^F{khx) . (80) 

A good approximate solution of (79) can be obtained by substituting the polynomial approximation 
(30) for F to obtain 

*(a;) ~/x-^-x[l-n(l-/x^-^)] . (81) 
The corresponding solution of 79) then reads 

Qirna) = ——^ — , (82) 

n/i + (1 — n)ii< 

which is equivalent to expression (46) derived above using an intuitive nonrigorous reasoning. In 
contrast, expression (82) and thus (46) is now obtained as an approximate solution of the exact 
equation (78). The accuracy of this approximation (82) (or (46)) can thus be checked by comparing 
it with the numerical solution of the exact equation (78). Correlatively, this also directly check 
the quality of expression (47). Figure 5 shows the ratio of the approximation (82) divided by the 
numerical solution of the exact equation (78), as a function of m^^ — mo for n = 0.9, 6=1 and three 
values of a = 0.7, 0.8, 0.9. One can observe that the quality of the approximation (82) improves as 
a gets closer to 1. 

C. Self-similarity of the statistics of observable events 

We now have the tools to calculate the conditional GPF Q(z;md) defined by (8) of the total 
number of observable events within an observable cluster. Substituting (7) into (67) yields the 
equation for Q{z;md): 

<p{e;md) = zF{K{md){l - 6)) , (83) 

where 

K{md) = Kfi{md)q{md) , (84) 

and 

^{q{md){l-x)) 
<p{x;md)^ Q(^) ■ (85) 

Definition (80) and equation (79) imply that the following identities are true 

99(0;md)=0, ^{l;md) = l. (86) 
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Using (80) and the approximate expression (30), we obtain the hnear approximation 



ip{x;md) ~ (pi{x) , ipi{x) = X , (87) 

which is consistent with (86). Then, substituting (87) into (83) yields an approximate equation for 
the GPF 0(z;md): 

e{z;ma) ~ zF{nimd)[l - e(^;m<,)]) . (88) 

We can check the accuracy of the hnear approximation (87), and thus its consequence for 
Q{z; rrifi) given by (88) by comparing the hnear function (fi{x) — x of (87) with the exact one given 
by (83). Figure 6 shows the difference 

Ai{x; nid) ^ (fi{x;md) - X , (89) 

where (fi{x; nid)— is given by (83), as a function of the variable x, for n — 0.9, 7 = 1.25 and 7 = 1.1 
and several values of nid — mo. This figure confirms the good accuracy of the linear approximation. 
The two following subsections will extract the consequence of this formulation for the distribution 
of aftershock numbers and will quantify the impact of the next order correction to the linear 
approximation (87). 

Note that equation (88) coincides, after the application of the renormalization (43) where n{md) 
is now given by expression (84), with the equation (68) for the GPF Q{z\k) of the total number 
of events within an arbitrary cluster. This has an important consequence for the physical under- 
standing of seismicity according to the ETAS model: as long as the linear approximation (87) is 
applicable, the statistics of the number of observable events within observable clusters is identical, 
up to the renormalization (43), to the statistics of the total number of events within an arbitrary 
cluster in which all events can be observed. 

This can be restated as the following self-similar property for the statistical properties of ob- 
servable clusters: 

Q{z-md)^Q{z\K{md)) . (90) 

This self-similarity property means that the statistics of observable windowed events within large 
time windows is identical after the correspondence 

u u}{md) = u}q{md) , n n{md) = ^'^^^^^ ^ (91) 

7-1 
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to the statistics of the total number of windowed events. The effective branching rate n{md) defined 
in (91) coincides, using the expression (84) for n{md) and (82) for qlmd), with expression (47) that 
we have previously obtained for the effective rate of observable aftershocks. This shows again that 
the intuitive probabilistic reasoning of section II E on effective aftershock rates is equivalent to the 
hnear approximation (87) for the GPF. As we are going to probe in greater depth, this suggests 
that the self-similar property (90) is a resilient and general feature of the ETAS model. 

D. Distribution of the number of observable events 

We now derive the consequences of the above results for the distribution of the numbers of 
observable events. 

Let us denote by V{r;md) the probability corresponding to the GPF Q{z;md) defined by (8), 
that there are r observable events in a given observable cluster. Similarly, we denote V{r; md) the 
distribution of the numbers of observable events within an arbitrary cluster corresponding to the 
GPF Q{z]md). The two GPF Q{z;md) and Q{z]md) are linked through equation (7). It follows 
from (7) and (88) that, within the domain of application of the linear approximation (87), these 
two probabilities can be expressed in terms of the probability V{r\K) of the total number of events 
of an arbitrary cluster via the following self-similar relations 

V{r;md) ~ V{r\K,{md)) , V{r;md) ~ q{md)V{r\K,{md)) , r ^ 1 . (92) 

Thus, the statistical properties of observable events are known from those of the all events via the 
scaling relations (92) (within the linear approximation (87) of the GPF). The self-similar properties 
(90) and (92) mean that the ETAS model is renormalized onto itself under the transformation 
mo nid, with just a renormalization from k to K{md) and, as a consequence, a renormalization 
of the branching ratio from n to n{md)- Our present analysis thus confirms for the full statistical 
properties the results obtained previously, based solely on the average seismic rates [48]. 

The statistics properties of the total number of events in individual aftershock clusters (without 
the constraint of observability) has been derived in our previous paper [35, 35]. Therefore, we just 
need to recall briefly some of its key properties which are useful for understanding the statistics of 
observable events. 
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Recall that the probability density V{r\K) is given by the Cauchy integral 

p(.|.) = _i_/l!^, (93) 
c 

where C is sufficiently small contour enveloping the origin z = 0, and Q{z\k,) is solution of the 
functional equation (68). The main difficulty in the calculation of the integral (93) is that the GPF 
Q{z\k) is defined only implicitly, via the solution of equation (68). To overcome this obstacle, we 
perform a change of variable and use the new integration variable y — Q{z\k). It follows from (68) 
that 

y 



Gi{y\K) 

which yields the explicit integral for V{r\K): 



(94) 



c ^ 

where C is some small contour in the complex plane y enveloping the origin y = 0. 

It is interesting to point out that relation (95) has an intuitive probabilistic interpretation, as 
it can be expressed as 



p(rk) = -Pr(y^ = r- 1) , (96) 
r 



where 



Ys = Y.^k. (97) 

fc=i 

and {f/i, f/2) • • • } ai'e mutually independent random integers with GPF G\(^z\k) given by (28) with 
(31). In the relevant regime for earthquakes for which, probably, 1 < 7 < 2, and for r ^ 1, the 
PDF of the sum (97) tends asymptotically to 

= = RIT^'^ (r^) ■ " = " ■ 

where i^{x) is the stable Levy distribution such that its two-sided Laplace transform is equal to 

/oo 
£^(a;)e-"^dx = e"^ . (99) 
-00 

This Levy distribution has the following properties 

£,(x)~^ (x-00), ^.(0) = \ 1/^) • M 
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One can calculate l^{x) numerically for any value 1 < 7 < 2 using, for instance, the following 
integral representation 



1 



TT Jo 



exp 



—V? + UX COS 



.7, 



sm 



/ TT \ vr 
MX sin — H — 
\1 7 



du 



The asymptotic expression for the probability (96) corresponding to (98) is 

1 , / r(l — n) — l\ 



V{r\K) 



(r > 1) . 



(101) 



(102) 



r(i/r)V7 \^ (z/r)V7 j 

When the average branching ratio n defined by (18) is close to l,Eq. (102) with (100) predicts the 
existence of two characteristic power laws for the probability 'P{r\n): 



V{r\K) 



,-1-1/7 



and 



where 



V{r\K) ~ r 



-1-7 



r — V 



1/(7-1) 



(r < r*) , 

(r > r*) , 

7/(7-1) 



1 -n 



(103) 



(104) 



(105) 



The power law (104) reflects the intrinsic distribution of the number of first-generation aftershocks 
given by relation (33), while the heavier power law tail (103) reflects the effects of cascades over 
many generations in the branching aftershocks triggering process [35] . See figure 2 in Ref . [35] for 
a visualization of the two power laws (103) and (104) and their cross-over. 

Then, substituting in (105) the effective branching rate n{md) for observable clusters, we obtain 
the dependence of the cross-over value r*{md) separating the two power laws (103) and (104) for 
the statistics of the number of observable events, as a function of the threshold magnitude — mo- 
Figure 7 shows r*{md) as a function of — niQ for 7 = 1.25 and several values of n. One can 
observe a fast decrease of r*{md) with — mo, which implies that increasing the observation 
magnitude threshold rud amounts to deviate more and more from criticality, as shown also directly 
in Figure 2. 



E. Deviations from self-similarity 



All above results on the self-similarity of the statistics of observable events expressed by relations 

(90) and (92) can be viewed as the consequence of the linear approximation (87). It is thus 
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important to explore how strong can be the deviations from self-similarity resulting from the 
properties of the exact equation (83) for the GPF 9 of the number of observable events within an 
observable cluster. In this goal, we rewrite equation (83) in the form 

e^zGi{e;md) , (106) 

where 

Gi{z;md) = Gi{z\K{md)g{z;md) , (107) 

and 

9{z;md) = / (108) 

One can interpret (106) and (107) as describing some branching process such that the GPF of the 
number of first-generation aftershocks is equal to Gilzinid)- In other words, the random number 
Ri{md) of first-generation aftershocks in this new branching model is equal to the sum of the two 
statistically independent random integers 

Ri {rrid) = U{md) + V{md) , (109) 

where U{md) has the self-similar GPF Gi{z\K,{md), while V{md) has the GPF g{z;md). 

Using (95), (96) and (107), we immediately obtain the exact integral representation of the PDF 
of the number of observable events in an observable cluster: 

V{r;md) = ^ f G[[z\K{mdW{y;md)^ . (110) 

Its corresponding probabilistic representation reads 

V{r;md)^-Pr(Yr^r-l) , (111) 
r 

where 

Ys^i2iUk + Vk) . (112) 

k=l 

The random integers {C/i, t/2, ■ ■ ■ } have the GPF GilzlKlrud)], while the random numbers 
{^1, V2, . . . } are random integers which are mutually statistically independent from each other 
(and from C/'s) with the GPF g{y;md) given by (108). As the transformation mo — > irid is equiv- 
alent to a decimation step in the language of the renormalization group (see [51] and [50] as well 
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as Chapter 11 of [52] for pedagogical introductions), the random variables {Vi, V2, . . . } with their 
GPF g{y;mci) correspond to a new relevant direction (branching process different from ETAS) in 
the space of branching processes. 

To obtain the statistical properties of the random integers V, consider the quadratic approxi- 
mation of the function (p{x;m.d) defined by (85): 

(fi{x; nid) ^ (fi2{x; nid) , (f2{x; nid) = x + ri{md)x{l - x) . (113) 

This second-order approximation is again consistent with the identities (86). Here, 

77(mrf) = 4Ai(l/2;mrf) , (114) 

where Ai is defined in (89). Figure 8 shows the difference 

^2{x] nid) = ^{.x] nid) - ^2{x; nid) , (115) 

as a function of x for different values of — mo for the same parameters as for Figure 6. 7 = 1.1 
and 1.25. Comparison between Figures 6 and 8 demonstrate the large improvement from the linear 
to the quadratic approximation (113). 

Substituting (113) into (108) yields the approximate GPF of the auxiliary random integers V 

as 

g{z;md) = — — -^-7^ r. (116) 

1 +V{^d){l -z) 

This GPF means that V has a geometric distribution with the following average and variance 

{V)=r^, al = r,{r, + l). (117) 

Thus, if 77 <^ 1, the random variable V has a small impact on the statistics of the number of 
observable events. In this case, we obtain the leading asymptotical contribution of the variable 

V to the statistics of observable events by using a power law expansion for the GPF Gi{z]nid)i 
similar to (31): 

G^{z;md) ^l + h{nid){l-z)+(5K\nid){l-z)^ , (118) 

where 

h{nid) = n{nid) + rj{nid) ■ (119) 
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Expression (118) shows that the main contribution of the random integer V resuhing from the 
first-order correction to the hnear approximation (87) is to introduce a small shift (for r] <^ 1) 
equal to r]{mij) to the effective branching rate n{mci) obtained within the linear approximation 
(87). Figure 9 shows the dependence of this shift r]{md) as a function of — mo for different 
values of 7. Since n{md) is typically in the range 0.5 — 1, this shows that the corrections are no 
more than about 10% in the value of the effective branching rate for observable events. 

IV. CONCLUSION 

We have shown that, to a good approximation, the ETAS model is renormalized onto itself 
under what amounts to a decimation procedure mo — > m^, with just a renormalization of the 
branching ratio from n to an effective value n{md)- Our present analysis thus confirms, for the 
full statistical properties, the results obtained previously in Ref. [48], based solely on the average 
seismic rates (the first-order moment of the statistics). However, our analysis also demonstrates 
that this renormalization is not exact, as there are small corrections which can be systematically 
calculated, in terms of additional contributions that can be mapped onto a different branching 
model (a new relevant direction in the language of the renormalization group). However, for prac- 
tical apphcations, due to the strong stochasticity of the ETAS branching model, these deviations 
from exact self-similarity will be difficult to observe. This justifies the standard procedure in sta- 
tistical parameter estimations of using the ETAS model with magnitude cut-off nid even if is 
an artificial detection threshold with no physical meaning for the triggering process. However, our 
results, which confirm by and large the conclusions of Ref. [48], show that the values of the branch- 
ing ratio (or average rate of generation of first-generation aftershocks) recovered by such statistical 
estimations is not the "true" one, but a effective or renormalized value. Thus, conclusions of the 
properties of aftershock clusters has to be re-examined in this light: echoing the main conclusion 
of Ref. [48], " previous estimates of the clustering characteristics of seismicity may significantly 
underestimate the true values." 
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Fig. 1: Dependence of the PDF's {r\K,md) and 7']~(r|K, m^) as a function of number r 
for 7 = 1.1, n = 0.9 and for fi{mc[) = 10 and 50, illustrating the presence of a power law tail 
~ r^^^^ for first-generation aftershocks triggered by observable sources and of fast decaying 
tails for first-generation aftershocks triggered by unobservable sources (of magnitude less than 
rrid). 
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Fig. 3: Dependence of effective rate n{md) given by (47) and (54) as a function of the 
branching ratio n of ah first-generation events, for a = 0.8 and 6=1 and several values of 
f^d — mo: rrid — tuq = 1; 2; 3; 4 from bottom to top. 
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Fig. 4: Dependence of the rates n^{md) quantifying the relative impact of aftershocks 
triggered by observable ("'") versus unobservable (~) events, as a function of rrid — mo, for 
a = 0.8, & = 1, and n = 0.9. 
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Fig. 5: Dependence of the ratio of the approximation (82) divided by the numerical solution 
of the exact equation (78), as a function of m^i — niQ, demonstrating the good accuracy of the 
approximate expression (82), for n = 0.9, b = 1: a = 0.7; 0.8; 0.9 from top to bottom. 
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Fig. 6: Dependence of the difference Ai(x;mrf) given by (89) as a function of the variable 
X, for n = 0.9,7 = 1-25 and several values of m^^ — rriQ = 1;2;3;4 (four upper curves from 
top to bottom). The group of almost undistinguishable curves at the bottom of the graph 
corresponds to n = 0.9, rud — rriQ = 1; 2; 3; 4 and 7 = 1.1. 
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Fig. 7: Dependence of the cross-over value r*{md) separating the two power laws (103) and 
(104) for the statistics of the number of observable events, for 7 = 1.25 and n = 0.9; 0.8; 0.7 
(top to bottom). 
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Fig. 8: Dependence of the difference A2(a;; m^) defined in (f 15) as a function of x for the same 
parameters as in Figure 6, demonstrating the high accuracy of the quadratic approximation 
(113). 
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Fig. 9: Dependence of the shift rj^m^) to the effective branching rate for observable events 
as a function of — mo for different values of 7 and for n = 0.9. 



41 



